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24  Abstract 

25  Here  we  report  the  genome  sequence  of  Yersinia  pestis  strain  Cadman,  an  attenuated  strain 

26  lacking  the  pgm  locus.  Y.  pestis  is  the  causative  agent  of  plague  and  generally  must  be  worked 

27  with  under  BSL-3  conditions.  However,  strains  lacking  the  pgm  locus  are  considered  safe  to 

28  work  with  under  BSL-2  conditions. 

29 

30  Genome  Announcement 

31  Yersinia  pestis  is  the  causative  agent  of  plague.  Due  to  its  high  potential  for  use  in  biological 

32  terrorism,  the  CDC  has  classified  it  a  category  A  pathogen,  which  must  be  handled  within  a 

33  biosafety  level  3  laboratory. 

34 

35  Here  we  report  the  complete  sequence  of  the  pigmentation  negative  (pgm-)  Cadman  strain  of  Y. 

36  pestis.  This  strain  was  isolated  in  1965  from  cerebrospinal  fluid  obtained  from  a  2l//2-ycar-old 

37  boy(l).  It  is  unknown  whether  the  original  isolate  was  pgm-  or  whether  this  mutation  occurred 

38  during  laboratory  passage.  However,  naturally  occurring  pgm-  isolates  have  been  associated  with 

39  disease  in  humans  and  rodents  (2).  The  virulence  of  pgm-  strains  of  Y.  pestis  is  attenuated  (3)  and 

40  therefore  they  can  be  manipulated  under  less  restrictive  laboratory  conditions. 

41 

42  Chromosomal  DNA  was  prepared  using  the  Qiagen  EZ1  Advanced  XL  system  with  the  EZ1 

43  DNA  Tissue  kit  according  to  manufacturer’s  instructions.  The  sequencing  library  was  prepared 

44  using  the  SMRTbell™  Template  Prep  Kit  (Pacific  Biosciences,  Menlo  Park,  CA)  following 

45  manufacturer’s  protocol.  The  resulting  SMRTbell  template  was  size  selected  on  BluePippin 

46  system  (Sage  Science,  Beverly,  MA)  using  0.75%  dye-free  agarose  cassette  with  4-10kb  Hi-Pass 
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47  protocol  and  low  cut  set  at  4  kb.  Size  selected  template  was  cleaned  and  concentrated  with 

48  AMPure  PB  beads.  The  P4  polymerase  was  used  with  the  C2  sequencing  kit  to  generate  240- 

49  minute  movies. 

50 

51  Raw  sequencing  reads  were  filtered  for  a  minimum  subread  length  of  lOOObp,  minimum 

52  polymerase  read  length  of  5000bp,  and  minimum  quality  of  0.80.  The  filtered  dataset  contained 

53  101,466  reads  with  a  mean  subread  length  of  8,409bp. 

54 

55  Reads  were  assembled  into  four  contigs  using  HGAP3  v2.3  (4).  These  contigs  correspond  to  the 

56  chromosome  and  plasmids  pCD  1 ,  pPCP  1 ,  and  pMT  1 .  Each  contig  was  checked  for  redundant 

57  sequence  at  the  ends  using  Gepard  vl  .3(5).  The  plasmids  were  detennined  to  be  complete 

58  circular  molecules,  so  redundant  sequences  at  the  ends  were  trimmed  to  one  copy  and  a  new 

59  breakpoint  was  chosen  for  their  linear  representations.  A  large  (~50kb)  repeat  on  the 

60  chromosome  was  not  able  to  be  resolved,  so  the  chromosome  was  not  modified.  Reads  were  then 

61  re-aligned  to  the  trimmed  and  shifted  draft  assembly  for  correction  using  Quiver  (4). 

62 

63  Annotation  with  NCBI’s  Prokaryotic  Genome  Annotation  Pipeline  v3.3  (6)  identified  4,238 

64  coding  regions,  19  ribosomal  RNAs,  68  tRNAs,  12  ncRNAs,  and  2  CRISPR  arrays. 

65  Gene  content  was  compared  to  Y.  pestis  C092  (NC_003 13 1,  NC_003 132,  NC_003 134, 

66  NC_003 143).  Nucleotide  sequences  of  annotated  genes  were  clustered  at  90%  identity  using 

67  USearch  (7).  Genes  without  assigned  orthologs  were  used  as  queries  in  a  BLAST  search  against 

68  both  genomes.  Genes  with  >90%  nucleotide  identity  to  a  region  of  the  C092  genome  were 

69  considered  to  be  present,  even  if  unannotated.  All  annotated  genes  outside  of  the  pgm  locus  in 
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70  C092  were  present  in  Cadman,  although  some  exhibit  alterations  that  may  result  in  loss  of 

71  function.  No  genes  present  in  Cadman  were  absent  in  C092.  Absence  of  the  pgm  locus  was 

72  confirmed  by  aligning  all  reads  to  the pgm+  C092  using  BWA-mem  v0.7. 12(8).  No  significant 

73  alignments  to  sequence  unique  to  the  pgm  locus  were  observed. 

74 

75  Nucleotide  sequence  accession  numbers.  Complete  sequences  for  all  molecules  have  been 

76  deposited  at  Genbank  under  the  accession  numbers  CPO 16273  to  CPO 16276. 

77 
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